================= CRYOSPARCW ======= 2021-07-26 14:44:11.983040 ========= Project P17 Job J773 Master jptitan Port 39002 =========================================================================== ========= monitor process now starting main process MAINPROCESS PID 334849 MAIN PID 334849 refine.newrun cryosparc_compute.jobs.jobregister ========= monitor process now waiting for main process ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat *************************************************************** Running job J773 of type nonuniform_refine_new Running job on hostname %s jptitan Allocated Resources : {'fixed': {'SSD': True}, 'hostname': 'jptitan', 'lane': 'default', 'lane_type': 'default', 'license': True, 'licenses_acquired': 1, 'slots': {'CPU': [0, 1, 2, 3], 'GPU': [0], 'RAM': [0, 1, 2]}, 'target': {'cache_path': '/scratch', 'cache_quota_mb': None, 'cache_reserve_mb': 10000, 'desc': None, 'gpus': [{'id': 0, 'mem': 11554717696, 'name': 'GeForce RTX 2080 Ti'}, {'id': 1, 'mem': 11554717696, 'name': 'GeForce RTX 2080 Ti'}, {'id': 2, 'mem': 11554324480, 'name': 'GeForce RTX 2080 Ti'}], 'hostname': 'jptitan', 'lane': 'default', 'monitor_port': None, 'name': 'jptitan', 'resource_fixed': {'SSD': True}, 'resource_slots': {'CPU': [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63], 'GPU': [0, 1, 2], 'RAM': [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15]}, 'ssh_str': 'jparmache@jptitan', 'title': 'Worker node jptitan', 'type': 'node', 'worker_bin_path': '/data/software/cryosparc/cryosparc2_worker/bin/cryosparcw'}} HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size ========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12)========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (342, 1, 2007, 81) 218 block size 256 grid size (342, 126, 1) global compute_resid_pow with (342, 1, 672, 44) 894 block size 256 grid size (342, 42, 1) global compute_resid_pow with (342, 1, 224, 24) 2362 block size 256 grid size (342, 14, 1) global compute_resid_pow with (342, 1, 80, 12) 2362 block size 256 grid size (342, 5, 1) global compute_resid_pow with (342, 1, 32, 8) 2362 block size 256 grid size (342, 2, 1) global compute_resid_pow with (342, 1, 16, 4) 2362 block size 256 grid size (342, 1, 1) global compute_resid_pow with (342, 1, 8, 4) 2362 block size 128 grid size (342, 8, 1) global compute_resid_pow with (342, 1, 19, 21) 2362 block size 256 grid size (342, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256========= sending heartbeat grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362========= sending heartbeat block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (343, 1, 2007, 81) 218 block size 256 grid size (343, 126, 1) global compute_resid_pow with (343, 1, 672, 44) 894 block size 256 grid size (343, 42, 1) global compute_resid_pow with (343, 1, 224, 24) 2362 block size 256 grid size (343, 14, 1) global compute_resid_pow with (343, 1, 80, 12) 2362 block size 256 grid size (343, 5, 1) global compute_resid_pow with (343, 1, 32, 8) 2362 block size 256 grid size (343, 2, 1) global compute_resid_pow with (343, 1, 16, 4) 2362 block size 256 grid size (343, 1, 1) global compute_resid_pow with (343, 1, 8, 4) 2362 block size 128 grid size (343, 8, 1) global compute_resid_pow with (343, 1, 19, 21) 2362 block size 256 grid size (343, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256========= sending heartbeat grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 2362========= sending heartbeat block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12)========= sending heartbeat 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (342, 1, 2007, 81) 218 block size 256 grid size (342, 126, 1) global compute_resid_pow with (342, 1, 672, 44) 894 block size 256 grid size (342, 42, 1) global compute_resid_pow with (342, 1, 224, 24) 2362 block size 256 grid size (342, 14, 1) global compute_resid_pow with (342, 1, 80, 12) 2362 block size 256 grid size (342, 5, 1) global compute_resid_pow with (342, 1, 32, 8) 2362 block size 256 grid size (342, 2, 1) global compute_resid_pow with (342, 1, 16, 4) 2362 block size 256 grid size (342, 1, 1) global compute_resid_pow with (342, 1, 8, 4) 2362 block size 128 grid size (342, 8, 1) global compute_resid_pow with (342, 1, 19, 21) 2362 block size 256 grid size (342, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256========= sending heartbeat grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362========= sending heartbeat block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (343, 1, 2007, 81) 218 block size 256 grid size (343, 126, 1) global compute_resid_pow with (343, 1, 672, 44) 894 block size 256 grid size (343, 42, 1) global compute_resid_pow with (343, 1, 224, 24) 2362 block size 256 grid size (343, 14, 1) global compute_resid_pow with (343, 1, 80, 12) 2362 block size 256 grid size (343, 5, 1) global compute_resid_pow with (343, 1, 32, 8) 2362 block size 256 grid size (343, 2, 1) global compute_resid_pow with (343, 1, 16, 4) 2362 block size 256 grid size (343, 1, 1) global compute_resid_pow with (343, 1, 8, 4) 2362 block size 128 grid size (343, 8, 1) global compute_resid_pow with (343, 1, 19, 21) 2362 block size 256 grid size (343, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: FSC No-Mask... 0.143 at 78.217 radwn. 0.5 at 46.147 radwn. Took 2.665s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 86.868 radwn. 0.5 at 71.895 radwn. Took 3.120s. FSC Loose Mask... ========= sending heartbeat 0.143 at 94.776 radwn. 0.5 at 79.529 radwn. Took 13.048s. FSC Tight Mask... ========= sending heartbeat 0.143 at 99.314 radwn. 0.5 at 86.729 radwn. Took 10.420s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (215, 1, 2007, 81) 218 block size 256 grid size (215, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (215, 1, 672, 44) 894 block size 256 grid size (215, 42, 1) global compute_resid_pow with (215, 1, 224, 24) 3604 block size 256 grid size (215, 14, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (215, 1, 80, 12) 14456 block size 256 grid size (215, 5, 1) global compute_resid_pow with (215, 1, 32, 8) 15498 block size 256 grid size (215, 2, 1) global compute_resid_pow with (215, 1, 16, 4) 15498 block size 256 grid size (215, 1, 1) global compute_resid_pow with (215, 1, 8, 4) 15498 block size 128 grid size (215, 8, 1) global compute_resid_pow with (215, 1, 19, 21) 15498 block size 256 grid size (215, 2, 1) global compute_resid_pow with (215, 1, 19, 21) 15498 block size 256 grid size (215, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size========= sending heartbeat (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (216, 1, 2007, 81) 218 block size 256 grid size (216, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (216, 1, 672, 44) 894 block size 256 grid size (216, 42, 1) global compute_resid_pow with (216, 1, 224, 24) 3604 block size 256 grid size (216, 14, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (216, 1, 80, 12) 14456 block size 256 grid size (216, 5, 1) global compute_resid_pow with (216, 1, 32, 8) 15498 block size 256 grid size (216, 2, 1) global compute_resid_pow with (216, 1, 16, 4) 15498 block size 256 grid size (216, 1, 1) global compute_resid_pow with (216, 1, 8, 4) 15498 block size 128 grid size (216, 8, 1) global compute_resid_pow with (216, 1, 19, 21) 15498 block size 256 grid size (216, 2, 1) global compute_resid_pow with (216, 1, 19, 21) 15498 block size 256 grid size (216, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12)========= sending heartbeat 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (215, 1, 2007, 81) 218 block size 256 grid size (215, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (215, 1, 672, 44) 894 block size 256 grid size (215, 42, 1) global compute_resid_pow with (215, 1, 224, 24) 3604 block size 256 grid size (215, 14, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (215, 1, 80, 12) 14456 block size 256 grid size (215, 5, 1) global compute_resid_pow with (215, 1, 32, 8) 15498 block size 256 grid size (215, 2, 1) global compute_resid_pow with (215, 1, 16, 4) 15498 block size 256 grid size (215, 1, 1) global compute_resid_pow with (215, 1, 8, 4) 15498 block size 128 grid size (215, 8, 1) global compute_resid_pow with (215, 1, 19, 21) 15498 block size 256 grid size (215, 2, 1) global compute_resid_pow with (215, 1, 19, 21) 15498 block size 256 grid size (215, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (216, 1, 2007, 81) 218 block size 256 grid size (216, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 15498 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15498 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15498 block size 256 grid size (500, 2, 1) global compute_resid_pow with (216, 1, 672, 44) 894 block size 256 grid size (216, 42, 1) global compute_resid_pow with (216, 1, 224, 24) 3604 block size 256 grid size (216, 14, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (216, 1, 80, 12) 14456 block size 256 grid size (216, 5, 1) global compute_resid_pow with (216, 1, 32, 8) 15498 block size 256 grid size (216, 2, 1) global compute_resid_pow with (216, 1, 16, 4) 15498 block size 256 grid size (216, 1, 1) global compute_resid_pow with (216, 1, 8, 4) 15498 block size 128 grid size (216, 8, 1) global compute_resid_pow with (216, 1, 19, 21) 15498 block size 256 grid size (216, 2, 1) global compute_resid_pow with (216, 1, 19, 21) 15498 block size 256 grid size (216, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: FSC No-Mask... ========= sending heartbeat 0.143 at 94.687 radwn. 0.5 at 77.547 radwn. Took 3.882s. FSC Spherical Mask... 0.143 at 98.051 radwn. 0.5 at 82.691 radwn. Took 3.793s. FSC Loose Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 102.718 radwn. 0.5 at 93.442 radwn. Took 14.691s. FSC Tight Mask... ========= sending heartbeat 0.143 at 109.158 radwn. 0.5 at 98.052 radwn. Took 11.333s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (116, 1, 2007, 81) 218 block size 256 grid size (116, 126, 1) global compute_resid_pow with (116, 1, 672, 44) 894 block size 256 grid size (116, 42, 1) global compute_resid_pow with (116, 1, 224, 24) 3604 block size 256 grid size (116, 14, 1) global compute_resid_pow with (116, 1, 80, 12) 14456 block size 256 grid size (116, 5, 1) global compute_resid_pow with (116, 1, 32, 8) 18720 block size 256 grid size (116, 2, 1) global compute_resid_pow with (116, 1, 16, 4) 18720 block size 256 grid size (116, 1, 1) global compute_resid_pow with (116, 1, 8, 4) 18720 block size 128 grid size (116, 8, 1) global compute_resid_pow with (116, 1, 19, 21) 18720 block size 256 grid size (116, 2, 1) global compute_resid_pow with (116, 1, 19, 21) 18720 block size 256 grid size (116, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12)========= sending heartbeat 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (116, 1, 2007, 81) 218 block size 256 grid size (116, 126, 1) global compute_resid_pow with (116, 1, 672, 44) 894 block size 256 grid size (116, 42, 1) global compute_resid_pow with (116, 1, 224, 24) 3604 block size 256 grid size (116, 14, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (116, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (116, 5, 1) global compute_resid_pow with (116, 1, 32, 8) 18720 block size 256 grid size (116, 2, 1) global compute_resid_pow with (116, 1, 16, 4) 18720 block size 256 grid size (116, 1, 1) global compute_resid_pow with (116, 1, 8, 4) 18720 block size 128 grid size (116, 8, 1) global compute_resid_pow with (116, 1, 19, 21) 18720 block size 256 grid size (116, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (116, 1, 19, 21) 18720 block size 256 grid size (116, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81)========= sending heartbeat 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (116, 1, 2007, 81) 218 block size 256 grid size (116, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (116, 1, 672, 44) 894 block size 256 grid size (116, 42, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (116, 1, 224, 24) 3604 block size 256 grid size (116, 14, 1) global compute_resid_pow with (116, 1, 80, 12) 14456 block size 256 grid size (116, 5, 1) global compute_resid_pow with (116, 1, 32, 8) 18720 block size 256 grid size (116, 2, 1) global compute_resid_pow with (116, 1, 16, 4) 18720 block size 256 grid size (116, 1, 1) global compute_resid_pow with (116, 1, 8, 4) 18720 block size 128 grid size (116, 8, 1) global compute_resid_pow with (116, 1, 19, 21) 18720 block size 256 grid size (116, 2, 1) global compute_resid_pow with (116, 1, 19, 21) 18720 block size 256 grid size (116, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4)========= sending heartbeat 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18720 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18720 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (116, 1, 2007, 81) 218 block size 256 grid size (116, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18720 block size 256 grid size (500, 2, 1) global compute_resid_pow with (116, 1, 672, 44) 894 block size 256 grid size (116, 42, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (116, 1, 224, 24) 3604 block size 256 grid size (116, 14, 1) global compute_resid_pow with (116, 1, 80, 12) 14456 block size 256 grid size (116, 5, 1) global compute_resid_pow with (116, 1, 32, 8) 18720 block size 256 grid size (116, 2, 1) global compute_resid_pow with (116, 1, 16, 4) 18720 block size 256 grid size (116, 1, 1) global compute_resid_pow with (116, 1, 8, 4) 18720 block size 128 grid size (116, 8, 1) global compute_resid_pow with (116, 1, 19, 21) 18720 block size 256 grid size (116, 2, 1) global compute_resid_pow with (116, 1, 19, 21) 18720 block size 256 grid size (116, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: FSC No-Mask... ========= sending heartbeat 0.143 at 96.398 radwn. 0.5 at 78.242 radwn. Took 2.103s. FSC Spherical Mask... 0.143 at 99.296 radwn. 0.5 at 86.924 radwn. Took 2.999s. FSC Loose Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 105.268 radwn. 0.5 at 96.313 radwn. Took 18.602s. FSC Tight Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 111.897 radwn. 0.5 at 100.377 radwn. Took 14.419s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 2007, 81) 218 block size 256 grid size (462, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 672, 44) 894 block size 256 grid size (462, 42, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (462, 1, 224, 24) 3604 block size 256 grid size (462, 14, 1) global compute_resid_pow with (462, 1, 80, 12) 14456 block size 256 grid size (462, 5, 1) global compute_resid_pow with (462, 1, 32, 8) 19672 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 16, 4) 19672 block size 256 grid size (462, 1, 1) global compute_resid_pow with (462, 1, 8, 4) 19672 block size 128 grid size (462, 8, 1) global compute_resid_pow with (462, 1, 19, 21) 19672 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 19, 21) 19672 block size 256 grid size (462, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 2007, 81) 218 block size 256 grid size (462, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (462, 1, 672, 44) 894 block size 256 grid size (462, 42, 1) global compute_resid_pow with (462, 1, 224, 24) 3604 block size 256 grid size (462, 14, 1) global compute_resid_pow with (462, 1, 80, 12) 14456 block size 256 grid size (462, 5, 1) global compute_resid_pow with (462, 1, 32, 8) 19672 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 16, 4) 19672 block size 256 grid size (462, 1, 1) global compute_resid_pow with (462, 1, 8, 4) 19672 block size 128 grid size (462, 8, 1) global compute_resid_pow with (462, 1, 19, 21) 19672 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 19, 21) 19672 block size 256 grid size (462, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4)========= sending heartbeat 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 2007, 81) 218 block size 256 grid size (462, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 672, 44) 894 block size 256 grid size (462, 42, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (462, 1, 224, 24) 3604 block size 256 grid size (462, 14, 1) global compute_resid_pow with (462, 1, 80, 12) 14456 block size 256 grid size (462, 5, 1) global compute_resid_pow with (462, 1, 32, 8) 19672 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 16, 4) 19672 block size 256 grid size (462, 1, 1) global compute_resid_pow with (462, 1, 8, 4) 19672 block size 128 grid size (462, 8, 1) global compute_resid_pow with (462, 1, 19, 21) 19672 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 19, 21) 19672 block size 256 grid size (462, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81)========= sending heartbeat 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size========= sending heartbeat 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size ========= sending heartbeat ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (462, 1, 2007, 81) 218 block size 256 grid size (462, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 672, 44) 894 block size 256 grid size (462, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 19672 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 19672 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 224, 24) 3604 block size 256 grid size (462, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 19672 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 80, 12) 14456 block size 256 grid size (462, 5, 1) global compute_resid_pow with (462, 1, 32, 8) 19672 block size 256 grid size (462, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (462, 1, 16, 4) 19672 block size 256 grid size (462, 1, 1) global compute_resid_pow with (462, 1, 8, 4) 19672 block size 128 grid size (462, 8, 1) global compute_resid_pow with (462, 1, 19, 21) 19672 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 19, 21) 19672 block size 256 grid size (462, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: FSC No-Mask... ========= sending heartbeat 0.143 at 98.293 radwn. 0.5 at 80.629 radwn. Took 7.452s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 101.393 radwn. 0.5 at 92.951 radwn. Took 5.656s. FSC Loose Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 109.720 radwn. 0.5 at 98.203 radwn. Took 18.852s. FSC Tight Mask... ========= sending heartbeat 0.143 at 114.378 radwn. 0.5 at 102.654 radwn. Took 14.376s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 115.435 radwn. 0.5 at 102.670 radwn. Took 21.571s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size========= sending heartbeat (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (462, 1, 2007, 81) 218 block size 256 grid size (462, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 672, 44) 894 block size 256 grid size (462, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 224, 24) 3604 block size 256 grid size (462, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 80, 12) 14456 block size 256 grid size (462, 5, 1) global compute_resid_pow with (462, 1, 32, 8) 20952 block size 256 grid size (462, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (462, 1, 16, 4) 20952 block size 256 grid size (462, 1, 1) global compute_resid_pow with (462, 1, 8, 4) 20952 block size 128 grid size (462, 8, 1) global compute_resid_pow with (462, 1, 19, 21) 20952 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 19, 21) 20952 block size 256 grid size (462, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (462, 1, 2007, 81) 218 block size 256 grid size (462, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 672, 44) 894 block size 256 grid size (462, 42, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (462, 1, 224, 24) 3604 block size 256 grid size (462, 14, 1) global compute_resid_pow with (462, 1, 80, 12) 14456 block size 256 grid size (462, 5, 1) global compute_resid_pow with (462, 1, 32, 8) 20952 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 16, 4) 20952 block size 256 grid size (462, 1, 1) global compute_resid_pow with (462, 1, 8, 4) 20952 block size 128 grid size (462, 8, 1) global compute_resid_pow with (462, 1, 19, 21) 20952 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 19, 21) 20952 block size 256 grid size (462, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4)========= sending heartbeat 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (462, 1, 2007, 81) 218 block size 256 grid size (462, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 672, 44) 894 block size 256 grid size (462, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 224, 24) 3604 block size 256 grid size (462, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 80, 12) 14456 block size 256 grid size (462, 5, 1) global compute_resid_pow with (462, 1, 32, 8) 20952 block size 256 grid size (462, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (462, 1, 16, 4) 20952 block size 256 grid size (462, 1, 1) global compute_resid_pow with (462, 1, 8, 4) 20952 block size 128 grid size (462, 8, 1) global compute_resid_pow with (462, 1, 19, 21) 20952 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 19, 21) 20952 block size 256 grid size (462, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81)========= sending heartbeat 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size ========= sending heartbeat ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (462, 1, 2007, 81) 218 block size 256 grid size (462, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 672, 44) 894 block size 256 grid size (462, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20952 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20952 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 224, 24) 3604 block size 256 grid size (462, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20952 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 80, 12) 14456 block size 256 grid size (462, 5, 1) global compute_resid_pow with (462, 1, 32, 8) 20952 block size 256 grid size (462, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (462, 1, 16, 4) 20952 block size 256 grid size (462, 1, 1) global compute_resid_pow with (462, 1, 8, 4) 20952 block size 128 grid size (462, 8, 1) global compute_resid_pow with (462, 1, 19, 21) 20952 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 19, 21) 20952 block size 256 grid size (462, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: FSC No-Mask... 0.143 at 98.404 radwn. 0.5 at 80.557 radwn. Took 2.210s. FSC Spherical Mask... 0.143 at 101.601 radwn. 0.5 at 93.500 radwn. Took 3.101s. FSC Loose Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 109.659 radwn. 0.5 at 98.407 radwn. Took 13.325s. FSC Tight Mask... ========= sending heartbeat 0.143 at 116.332 radwn. 0.5 at 102.784 radwn. Took 15.406s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat 0.143 at 115.840 radwn. 0.5 at 102.841 radwn. Took 36.122s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in cufft.Plan.__del__: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 2007, 81) 218 block size 256 grid size (462, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 672, 44) 894 block size 256 grid size (462, 42, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (462, 1, 224, 24) 3604 block size 256 grid size (462, 14, 1) global compute_resid_pow with (462, 1, 80, 12) 14456 block size 256 grid size (462, 5, 1) global compute_resid_pow with (462, 1, 32, 8) 21068 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 16, 4) 21068 block size 256 grid size (462, 1, 1) global compute_resid_pow with (462, 1, 8, 4) 21068 block size 128 grid size (462, 8, 1) global compute_resid_pow with (462, 1, 19, 21) 21068 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 19, 21) 21068 block size 256 grid size (462, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12)========= sending heartbeat 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (462, 1, 2007, 81) 218 block size 256 grid size (462, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 672, 44) 894 block size 256 grid size (462, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 224, 24) 3604 block size 256 grid size (462, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 80, 12) 14456 block size 256 grid size (462, 5, 1) global compute_resid_pow with (462, 1, 32, 8) 21068 block size 256 grid size (462, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (462, 1, 16, 4) 21068 block size 256 grid size (462, 1, 1) global compute_resid_pow with (462, 1, 8, 4) 21068 block size 128 grid size (462, 8, 1) global compute_resid_pow with (462, 1, 19, 21) 21068 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 19, 21) 21068 block size 256 grid size (462, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (462, 1, 2007, 81) 218 block size 256 grid size (462, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 672, 44) 894 block size 256 grid size (462, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 224, 24) 3604 block size 256 grid size (462, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 80, 12) 14456 block size 256 grid size (462, 5, 1) global compute_resid_pow with (462, 1, 32, 8) 21068 block size 256 grid size (462, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (462, 1, 16, 4) 21068 block size 256 grid size (462, 1, 1) global compute_resid_pow with (462, 1, 8, 4) 21068 block size 128 grid size (462, 8, 1) global compute_resid_pow with (462, 1, 19, 21) 21068 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 19, 21) 21068 block size 256 grid size (462, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size========= sending heartbeat 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (462, 1, 2007, 81) 218 block size 256 grid size (462, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 672, 44) 894 block size 256 grid size (462, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21068 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21068 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 224, 24) 3604 block size 256 grid size (462, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 19, 21) 21068 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 80, 12) 14456 block size 256 grid size (462, 5, 1) global compute_resid_pow with (462, 1, 32, 8) 21068 block size 256 grid size (462, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (462, 1, 16, 4) 21068 block size 256 grid size (462, 1, 1) global compute_resid_pow with (462, 1, 8, 4) 21068 block size 128 grid size (462, 8, 1) global compute_resid_pow with (462, 1, 19, 21) 21068 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 19, 21) 21068 block size 256 grid size (462, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: FSC No-Mask... 0.143 at 99.123 radwn. 0.5 at 81.454 radwn. Took 2.006s. FSC Spherical Mask... 0.143 at 102.515 radwn. 0.5 at 94.725 radwn. Took 2.760s. FSC Loose Mask... ========= sending heartbeat 0.143 at 111.227 radwn. 0.5 at 99.257 radwn. Took 10.207s. FSC Tight Mask... ========= sending heartbeat 0.143 at 118.472 radwn. 0.5 at 104.080 radwn. Took 10.784s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat 0.143 at 118.421 radwn. 0.5 at 103.405 radwn. Took 22.976s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: divide by zero encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in cufft.Plan.__del__: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81)========= sending heartbeat 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 2007, 81) 218 block size 256 grid size (462, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 672, 44) 894 block size 256 grid size (462, 42, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (462, 1, 224, 24) 3604 block size 256 grid size (462, 14, 1) global compute_resid_pow with (462, 1, 80, 12) 14456 block size 256 grid size (462, 5, 1) global compute_resid_pow with (462, 1, 32, 8) 22030 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 16, 4) 22030 block size 256 grid size (462, 1, 1) global compute_resid_pow with (462, 1, 8, 4) 22030 block size 128 grid size (462, 8, 1) global compute_resid_pow with (462, 1, 19, 21) 22030 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 19, 21) 22030 block size 256 grid size (462, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12)========= sending heartbeat 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (462, 1, 2007, 81) 218 block size 256 grid size (462, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (462, 1, 672, 44) 894 block size 256 grid size (462, 42, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 224, 24) 3604 block size 256 grid size (462, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (462, 1, 80, 12) 14456 block size 256 grid size (462, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 32, 8) 22030 block size 256 grid size (462, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 16, 4) 22030 block size 256 grid size (462, 1, 1) global compute_resid_pow with (462, 1, 8, 4) 22030 block size 128 grid size (462, 8, 1) global compute_resid_pow with (462, 1, 19, 21) 22030 block size 256 grid size (462, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (462, 1, 19, 21) 22030 block size 256 grid size (462, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (462, 1, 2007, 81) 218 block size 256 grid size (462, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 672, 44) 894 block size 256 grid size (462, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 224, 24) 3604 block size 256 grid size (462, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 80, 12) 14456 block size 256 grid size (462, 5, 1) global compute_resid_pow with (462, 1, 32, 8) 22030 block size 256 grid size (462, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (462, 1, 16, 4) 22030 block size 256 grid size (462, 1, 1) global compute_resid_pow with (462, 1, 8, 4) 22030 block size 128 grid size (462, 8, 1) global compute_resid_pow with (462, 1, 19, 21) 22030 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 19, 21) 22030 block size 256 grid size (462, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81)========= sending heartbeat 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (462, 1, 2007, 81) 218 block size 256 grid size (462, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 672, 44) 894 block size 256 grid size (462, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22030 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22030 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 224, 24) 3604 block size 256 grid size (462, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 22030 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 80, 12) 14456 block size 256 grid size (462, 5, 1) global compute_resid_pow with (462, 1, 32, 8) 22030 block size 256 grid size (462, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (462, 1, 16, 4) 22030 block size 256 grid size (462, 1, 1) global compute_resid_pow with (462, 1, 8, 4) 22030 block size 128 grid size (462, 8, 1) global compute_resid_pow with (462, 1, 19, 21) 22030 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 19, 21) 22030 block size 256 grid size (462, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: FSC No-Mask... 0.143 at 99.490 radwn. 0.5 at 81.882 radwn. Took 1.936s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 102.546 radwn. 0.5 at 95.132 radwn. Took 2.769s. FSC Loose Mask... ========= sending heartbeat 0.143 at 111.201 radwn. 0.5 at 99.634 radwn. Took 9.814s. FSC Tight Mask... ========= sending heartbeat 0.143 at 119.206 radwn. 0.5 at 104.131 radwn. Took 9.837s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 118.852 radwn. 0.5 at 103.675 radwn. Took 26.078s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in cufft.Plan.__del__: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (462, 1, 2007, 81) 218 block size 256 grid size (462, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 672, 44) 894 block size 256 grid size (462, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 224, 24) 3604 block size 256 grid size (462, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 80, 12) 14456 block size 256 grid size (462, 5, 1) global compute_resid_pow with (462, 1, 32, 8) 22184 block size 256 grid size (462, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (462, 1, 16, 4) 22184 block size 256 grid size (462, 1, 1) global compute_resid_pow with (462, 1, 8, 4) 22184 block size 128 grid size (462, 8, 1) global compute_resid_pow with (462, 1, 19, 21) 22184 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 19, 21) 22184 block size 256 grid size (462, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12)========= sending heartbeat 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (462, 1, 2007, 81) 218 block size 256 grid size (462, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (462, 1, 672, 44) 894 block size 256 grid size (462, 42, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 224, 24) 3604 block size 256 grid size (462, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (462, 1, 80, 12) 14456 block size 256 grid size (462, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 32, 8) 22184 block size 256 grid size (462, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 16, 4) 22184 block size 256 grid size (462, 1, 1) global compute_resid_pow with (462, 1, 8, 4) 22184 block size 128 grid size (462, 8, 1) global compute_resid_pow with (462, 1, 19, 21) 22184 block size 256 grid size (462, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (462, 1, 19, 21) 22184 block size 256 grid size (462, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (462, 1, 2007, 81) 218 block size 256 grid size (462, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (462, 1, 672, 44) 894 block size 256 grid size (462, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (462, 1, 224, 24) 3604 block size 256 grid size (462, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 80, 12) 14456 block size 256 grid size (462, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 32, 8) 22184 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 16, 4) 22184 block size 256 grid size (462, 1, 1) global compute_resid_pow with (462, 1, 8, 4) 22184 block size 128 grid size (462, 8, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (462, 1, 19, 21) 22184 block size 256 grid size (462, 2, 1) global compute_resid_pow with (462, 1, 19, 21) 22184 block size 256 grid size (462, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size========= sending heartbeat 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size========= sending heartbeat (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (462, 1, 2007, 81) 218 block size 256 grid size (462, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (462, 1, 672, 44) 894 block size 256 grid size (462, 42, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (462, 1, 224, 24) 3604 block size 256 grid size (462, 14, 1) global compute_resid_pow with (500, 1, 32, 8) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22184 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22184 block size 128 grid size (500, 8, 1) global compute_resid_pow with (462, 1, 80, 12) 14456 block size 256 grid size (462, 5, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 32, 8) 22184 block size 256 grid size (462, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22184 block size 256 grid size (500, 2, 1) global compute_resid_pow with (462, 1, 16, 4) 22184 block size 256 grid size (462, 1, 1) global compute_resid_pow with (462, 1, 8, 4) 22184 block size 128 grid size (462, 8, 1) global compute_resid_pow with (462, 1, 19, 21) 22184 block size 256 grid size (462, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (462, 1, 19, 21) 22184 block size 256 grid size (462, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: FSC No-Mask... ========= sending heartbeat 0.143 at 99.831 radwn. 0.5 at 82.032 radwn. Took 2.168s. FSC Spherical Mask... 0.143 at 102.638 radwn. 0.5 at 95.372 radwn. Took 3.062s. FSC Loose Mask... ========= sending heartbeat 0.143 at 111.218 radwn. 0.5 at 99.802 radwn. Took 10.567s. FSC Tight Mask... ========= sending heartbeat 0.143 at 119.505 radwn. 0.5 at 104.033 radwn. Took 11.177s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat 0.143 at 119.440 radwn. 0.5 at 103.661 radwn. Took 32.733s. ---- Computing FSC with mask 2.00 to 6.00 FSC No-Mask... 0.143 at 99.831 radwn. 0.5 at 82.032 radwn. Took 1.917s. FSC Spherical Mask... 0.143 at 102.638 radwn. 0.5 at 95.372 radwn. Took 2.755s. FSC Loose Mask... ========= sending heartbeat 0.143 at 111.218 radwn. 0.5 at 99.802 radwn. Took 10.171s. FSC Tight Mask... ========= sending heartbeat 0.143 at 124.277 radwn. 0.5 at 108.915 radwn. Took 11.296s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat 0.143 at 121.514 radwn. 0.5 at 105.668 radwn. Took 30.944s. ---- Computing FSC with mask 2.25 to 7.00 FSC No-Mask... 0.143 at 99.831 radwn. 0.5 at 82.032 radwn. Took 1.972s. FSC Spherical Mask... 0.143 at 102.638 radwn. 0.5 at 95.372 radwn. Took 3.058s. FSC Loose Mask... ========= sending heartbeat 0.143 at 111.218 radwn. 0.5 at 99.802 radwn. Took 11.325s. FSC Tight Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 123.321 radwn. 0.5 at 108.103 radwn. Took 12.023s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat 0.143 at 121.807 radwn. 0.5 at 106.241 radwn. Took 30.232s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: *************************************************************** /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: divide by zero encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) ========= main process now complete. ========= monitor process now complete.